FEAT: New Audio Converters by petebryan · Pull Request #1375 · Azure/PyRIT

petebryan · 2026-02-18T03:52:59Z

Description

Added new audio convertors to add the following:

Change the speed of an audio file without altering pitch AudioSpeedConverter
Add whitenoise over an existing audio file AudioWhiteNoiseConverter
Add an echo to an existing audio file AudioEchoConverter
Adjust volume of an audio file by scaling the amplitude. AudioVolumeConverter

Added new translation convertor to allow for mid sentence language switching in a prompt MultiLanguageTranslationConverter

Distinct from RandomTranslationConverter as focused on segment level granularity and deterministic translation.

Updated AzureSpeechTextToAudioConverter to handle a situation where an audio file input is handled and just passed back out. This handles situations when using the convertors with conversation history that may include mixed audio and text Messages that would otherwise throw exceptions.

Sorry I did not raise an issue for this ahead of time, experimentation of ideas turned into code and wanted to contribute. Happy to refactor whoever is deemed best.

Tests and Documentation

Added unit tests for all convertors to test functionality and ensure audio transformations do not adversely affect audio files. (58 unit tests)
Updated AzureSpeechTextToAudioConverter tests to test for case when audio_file is provided as input after update. (1 unit test)
Updated convertor documentation .py files to reflect these updates then ran jupytext --execute --to notebook to generate notebooks.

…to pebryan_audio

pyrit/prompt_converter/audio_echo_converter.py

pyrit/prompt_converter/audio_speed_converter.py

romanlutz · 2026-02-20T13:59:30Z

pyrit/prompt_converter/azure_speech_text_to_audio_converter.py

+        if not self.input_supported(input_type):
+            raise ValueError("Input type not supported")
+
+        # If the input is already an audio path, pass it through unchanged.
+        if input_type == "audio_path":
+            return ConverterResult(output_text=prompt, output_type="audio_path")


You must be thinking of a use case I am unable to anticipate 🙂 Can you elaborate?

Sometimes you want to generate attacks that include previous turns and then add new turns on top. The problem is if those previous turns were audio, and the new turn you want to add on top is based on a text prompt. Then you have a mix of audio & text together and when you have the convertor attached to the target all prompts go through the convertor leading to it throwing an error when it tries to convert the audio_file of the previous turns. You could account for this in the notebook at run time but its easier and cleaner to have the convertor handle this and just pass through things are already audio. I couldn't really see a downside to having this in the convertor but keen to know if you can think of a problem this may cause.

romanlutz · 2026-02-20T14:03:06Z

pyrit/prompt_converter/multi_language_translation_converter.py

+logger = logging.getLogger(__name__)
+
+
+class MultiLanguageTranslationConverter(PromptConverter):


This is actually doable with the selective text converter + translation converter, see https://azure.github.io/PyRIT/code/converters/6_selectively_converting.html#example-7-applying-converters-to-different-parts

I do see the appeal of having a shortcut, though. Wdyt?

Yeah so I spent some time on this. My reasoning for having a separate convertor for this is:

Without digging into the docs a bit its not easy to see how to do this, type of splitting so its not easily discoverable

Implementing it in a notebook flow is a bit cumbersome, especially when you want to try a lot of different approaches. Having a convertor makes it cleaner and easier to implement but happy to change course if other disagree.

Baking the capability into the RandomTranslationConverter could be doable be would add a level of complexity to the convertor that I felt having a separate one made sense from a maintainability point of view but very happy to take guidance on this.

I'm wondering if we could do something like:

Wrap the splitting and chaining logic in something like SequenceLevelConverter (effectively a generalized version of WordLevelConverter)

Merge MultiLanguageTranslationConverter and RandomTranslationConverter, maybe inheriting this new SequenceLevelConverter, supporting both fixed/random language selection, and sequence/word splitting.

I would like to have @rlundeen2 chime in since he created STC. I briefly considered if this should be a shortcut to do what selective text converter does for this but it feels... not easier? Maybe because I'm already familiar with it. In any case, I don't see a case for having the implementation. At most, it should be an alias for using the selective text converter under the hood.

…tors notebook to populate new convertors.

…to pebryan_audio

hannahwestra25 · 2026-02-20T22:12:00Z

pyrit/prompt_converter/multi_language_translation_converter.py

+        logger.info(
+            "Multi-language translation complete: %d segments across languages %s",
+            len(translated_segments),
+            self.languages[: len(segments)],


nit:

Suggested change

self.languages[: len(segments)],

self.languages[: len(self.languages)],

hannahwestra25 · 2026-02-20T22:15:55Z

pyrit/prompt_converter/multi_language_translation_converter.py

+            language = self.languages[i]
+
+            system_prompt = self._prompt_template.render_template_value(languages=language)
+            conversation_id = str(uuid.uuid4())


does it matter at all that all of these are going to be part of a different conversation?

hannahwestra25 · 2026-02-20T22:18:07Z

pyrit/prompt_converter/audio_speed_converter.py

+        Raises:
+            ValueError: If speed_factor is not positive.
+        """
+        if speed_factor <= 0:


is there an upper bound ?

hannahwestra25 · 2026-02-20T22:21:35Z

pyrit/prompt_converter/audio_white_noise_converter.py

+            info = np.iinfo(data.dtype)
+            max_val = float(info.max)
+        else:
+            max_val = 1.0


should info be assigned here ? what happens on line 80 if it's not ?

Pete Bryan and others added 6 commits February 10, 2026 11:48

Added new audio convertors

106fb0b

Added more convertors

3c7ca6a

Linting fixes

a2a2296

Merge branch 'Azure:main' into pebryan_audio

34b4991

Updated docs and generated notebooks

685c20b

Merge branch 'pebryan_audio' of https://github.com/petebryan/PyRIT in…

81ac7cf

…to pebryan_audio

petebryan marked this pull request as draft February 18, 2026 03:53

Pete Bryan added 2 commits February 18, 2026 08:38

Added new convetors to api docs

b973ee4

added type parameters to bare ndarray

78f05b0

petebryan marked this pull request as ready for review February 18, 2026 17:40

petebryan changed the title ~~[DRAFT] FEAT - New Audio Convertors~~ FEAT: New Audio Convertors Feb 18, 2026

romanlutz changed the title ~~FEAT: New Audio Convertors~~ FEAT: New Audio Converters Feb 20, 2026

romanlutz reviewed Feb 20, 2026

View reviewed changes

pyrit/prompt_converter/audio_echo_converter.py Show resolved Hide resolved

romanlutz reviewed Feb 20, 2026

View reviewed changes

pyrit/prompt_converter/audio_speed_converter.py Show resolved Hide resolved

romanlutz reviewed Feb 20, 2026

View reviewed changes

romanlutz and others added 6 commits February 20, 2026 06:03

Merge branch 'main' into pebryan_audio

ef8b46f

reformatted speed convertor to break down large method. Re-ran conver…

acaa14e

…tors notebook to populate new convertors.

Merge branch 'pebryan_audio' of https://github.com/petebryan/PyRIT in…

1caa9cc

…to pebryan_audio

Merge branch 'main' into pebryan_audio

5ca316a

fixed missing Type parameters

137d1e3

Merge branch 'pebryan_audio' of https://github.com/petebryan/PyRIT in…

58d9e05

…to pebryan_audio

hannahwestra25 reviewed Feb 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

FEAT: New Audio Converters#1375

FEAT: New Audio Converters#1375
petebryan wants to merge 14 commits intoAzure:mainfrom
petebryan:pebryan_audio

petebryan commented Feb 18, 2026

Uh oh!

Uh oh!

Uh oh!

romanlutz Feb 20, 2026

Uh oh!

petebryan Feb 20, 2026

Uh oh!

romanlutz Feb 20, 2026

Uh oh!

petebryan Feb 20, 2026 •

edited

Loading

Uh oh!

fdubut Feb 20, 2026

Uh oh!

romanlutz Feb 21, 2026

Uh oh!

hannahwestra25 Feb 20, 2026

Uh oh!

hannahwestra25 Feb 20, 2026

Uh oh!

hannahwestra25 Feb 20, 2026

Uh oh!

hannahwestra25 Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		logger = logging.getLogger(__name__)


		class MultiLanguageTranslationConverter(PromptConverter):

	self.languages[: len(segments)],
	self.languages[: len(self.languages)],

Comments

Conversation

petebryan commented Feb 18, 2026

Description

Tests and Documentation

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

petebryan Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

petebryan Feb 20, 2026 •

edited

Loading